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BACKGROUND OF THE INVENTION 
The present invention is directed to the processing of acoustic signals, and more 
particularly, but not exclusively, relates to techniques to extract an acoustic signal from a 
25 selected source while suppressing interference from other sources using two or more 
microphones. 

The difficulty of extracting a desired signal in the presence of interfering signals is 
a long-standing problem confronted by acoustic engineers. This problem impacts the 
design and construction of many kinds of devices such as systems for voice recognition 
30 and intelligence gathering. EspeciaUy troublesome is the separation of desired sound 
from unwanted sound with hearing aid devices. Generally, hearing aid devices do not 
permit selective amplification of a desired sound when contaminated by noise from a 
nearby source. This problem is even more severe when the desired sound is a speech 
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signal and the neaiby noise is also a speech signal produced by other talkers. As used 
herein, "noise" refers not only to random or nondeterministic signals, but also to 
undesired signals and signals interfering with the perception of a desired signal. 
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SUMMARY OF THE INVENTION 
One form of the present invention includes a unique signal processing technique 
using two or more microphones. Other forms include unique devices and methods for 
processing acoustic signals. 
5 Further embodiments, objects, features, aspects, benefits, forms, and advantages of 

the present invention shall become apparent from the detailed drawings and descriptions 
provided herein. 
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BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a diagrammatic view of a signal processing system. 
FIG. 2 is a diagram further depicting selected aspects of the system of FIG. 1. 
FIG. 3 is a flow chart of a routine for operating the system of FIG. 1. 
5 FIGs. 4 and 5 depict other embodiments of the present invention corresponding to 

hearing aid and computer voice recognition applications of the system of FIG. 1, 
respectively. 

FIG. 6 is a diagrammatic view of an experimental setup of the system of FIG. 1. 

FIG. 7 is a graph of magnitude versus time of a target speech signal and two 
10 interfering speech signals. 

FIG. 8 is a graph of magnitude versus time of a composite of the speech signals of 
FIG. 7 before processing, an extracted signal corresponding to the target speech signal of 
FIG. 7, and a duplicate of the target speech signal of FIG. 7 for comparison. 

FIG. 9 is a graph providing line plots for regularization factor (M) values of 1.001, 
15 1.005, 1.01, and 1.03 in terms of beamwidth versus frequency. 

FIG. 10 is a flowchart of a procedure that can be performed with the system of 
FIG. 1 either with or without the routine of FIG 3. 

FIGs. 11 and 12 are graphs illustrating the efficacy of the procedure of FIG. 10. 

20 
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DESCRIPTION OF SEUECTED EMBODIMENTS 
While the present invention can take many different forms, for the purpose of 
promoting an understanding of the principles of the invention, reference will now be 
made to the embodiments illustrated in the drawings and specific language will be used 
S to describe the same. It will nevertheless be understood that no limitation of the scope of 
the invention is thereby intended. Any alterations and further modifications of the 
described embodiments, and any further applications of the principles of the invention as 
described herein are contemplated as would normally occur to one skilled in the art to 
which the invention relates. 

10 FIG. 1 illustrates an acoustic signal processing system 10 of one 

embodiment of the present invention. System 10 is configured to extract a desired 
acoustic excitation from acoustic source 12 in the presence of interference or noise 
from other sources, such as acoustic sources 14, 16. System 10 includes acoustic 
sensor array 20. For the example illustrated, sensor array 20 includes a pair of 

15 acoustic sensors 22, 24 within the reception range of sources 12, 14, 16. Acoustic 
sensors 22, 24 are arranged to detect acoustic excitation from sources 12, 14, 16, 

Sensors 22, 24 are separated by distance D as illustrated by the like labeled 
line segment along lateral axis T. Lateral axis T is perpendicular to azimuthal axis 
AZ. Midpoint M represents the halfway point along distance D firom sensor 22 to 
. 20 sensor 24, Axis AZ intersects midpoint M and acoustic source 12, Axis AZ is 
designated as a point of reference (zero degrees) for sources 12, 14, 16 in the 
azimuthal plane and for sensors 22, 24. For the depicted embodiment, sources 14, 
16 define azimuthal angles 14a, 16a relative to axis AZ of about +22° and -65°, 
respectively. Correspondingly, acoustic source 12 is at 0° relative to axis AZ. In 

25 one mode of operation of system 10, the "on axis" alignment of acoustic source 12 
with axis AZ selects it as a desired or target source of acoustic excitation to be 
monitored with system 10. In contrast, the "off-axis" sources 14, 16 are treated as 
noise and suppressed by system 10, which is explained in more detail hereinafter. 
To adjust the direction being monitored, sensors 22, 24 can be moved to change 

30 the position of axis AZ. In an additional or alternative operating mode, the 

designated monitoring direction can be adjusted by changing a direction indicator 

5 
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incorporated in the routine of FIG. 3 as more fully described below. For these 
operating modes, it should be understood that neither sensor 22 nor 24 needs to be 
moved to change the designated monitoring direction, and the designated 
monitoring direction need not be coincident with axis AZ. 
S In one embodiment, sensors 22, 24 are omnidirectional dynamic 

microphones. In other embodiments, a different type of microphone, such as 
cardioid or hypercardioid variety could be utilized, or such different sensor type 
can be utilized as would occur to one skilled in the art. Also, in alternative 
embodiments more or fewer acoustic sources at different azimuths may be present; 
10 where the illustrated number and arrangement of sources 12, 14, 16 is provided as 

« 

merely one of many examples. In one such example, a room with several groups 
of individuals engaged in simultaneous conversation may provide a number of the 
sources. 

Sensors 22, 24 are operatively coupled to processing subsystem 30 to 
15 process signals received therefrom. For the convenience of description, sensors 
22, 24 are designated as belonging to left channel L and right channel R, 
respectively. Further, the analog time domain signals provided by sensors 22, 24 
to processing subsystem 30 are designated xiXt) and jcr(0 for the respective 
channels L and R. Processing subsystem 30 is operable to provide an output signal 
that suppresses interference from sources 14, 16 in favor of acoustic excitation 
detected from the selected acoustic source 12 positioned along axis AZ. This 
output signal is provided to output device 90 for presentation to a user in the form 
of an audible or visual signal which can be further processed. 

Referring additionally to FIG. 2, a diagram is provided that depicts other 
details of system 10. Processing subsystem 30 includes signal conditioner/filters 
32a and 32b to filter and condition input signals xiXf) and jcr(0 from sensors 22, 24; 
where t represents time. After signal conditioner/filter 32a and 32b, the 
conditioned signals are input to corresponding Analog-to-Digital iA/D) converters 
34a, 34b to provide discrete signals jcl(z) and xr(z), for channels L and R, 
respectively; where z indexes discrete sampling events. The sampling rate^ is 
selected to provide desired fidelity for a frequency range of interest. Processing 
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subsystem 30 also includes digital circuitry 40 comprising processor 42 and 
memory 50. Discrete signals xjXz) and xr{z) are stored in sample buffer 52 of 
memory 50 in a First-In-First-Out (FIFO) fashion. 

Processor 42 can be a software or firmware progranamable device, a state 
S logic machine, or a combination of both programmable and dedicated hardware. 
Furthermore, processor 42 can be comprised of one or more components and can 
include one or more Central Processing Units (CPUs). In one embodiment, 
processor 42 is in the form of a digitally programmable, highly integrated 
semiconductor chip particularly suited for signal processing. In other 

10 embodiments, processor 42 may be of a general purpose type or other arrangement 
as would occur to those skilled in the art. 

Likewise, memory 50 can be variously configured as would occur to those 
skilled in the art. Memory 50 can include one or more types of solid-state 
electronic memory, magnetic memory, or optical memory of the volatile and/or 

15 nonvolatile variety. Furthermore, memory can be integral with one or more other 
components of processing subsystem 30 and/or comprised of one or more distinct 
components. 

Processing subsystem 30 can include any oscillators, control clocks, 
interfaces, signal conditioners, additional filters, limiters, converters, power 

20 supplies, communication ports, or other types of components as would occur to 
those skilled in the art to implement the present invention. In one embodiment, 
subsystem 30 is provided in the form of a single microelectronic device. 

Referring also to the flow chart of FIG. 3, routine 140 is illustrated. Digital 
circuitry 40 is configured to perform routine 140. Processor 42 executes logic to 

25 perform at least some the operations of routine 140. By way of nonlimiting 
example, this logic can be in the form of software programming instructions, 
hardware, firmware, or a combination of these. The logic can be partially or 
completely stored on memory 50 and/or provided with one or more other 
components or devices. By way of nonlimiting example, such logic can be 

30 provided to processing subsystem 30 in the form of signals that are carried by a 
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(1) 



where: 



W(Jfc) = 



5 Y(k) is the output signal in frequency domain form, Wtik) and Wr(Jc) are complex 
valued multipliers (weights) for each ftequency k corresponding to channels L and 
R, the superscript denotes the complex conjugate operation, and the superscript 
"H" denotes taking the Hermitian of a vector. For this approach, it is desired to 
determine an "optimal" set of weights Whik) and WrCK) to minimize variance of 

10 Y(k). Minimizing the variance generally causes cancellation of sources not aligned 
with the desired direction. For the mode of operation where the desired direction is 
along axis AZ, frequency components which do not originate from directly ahead 
of the array are attenuated because they are not consistent in phase across the left 
and right channels L, R, and therefore have a larger variance than a source directly 

1 S ahead. Minimizing the variance in this case is equivalent to minimizing the output 
power of off-axis sources, as related by the optimization goal of relationship (2) 
that follows: 



MmE{\nkf} (2) 

20 

where Y(k) is the output signal described in connection with relationship (1). In 
one form, the constraint requires that "on axis" acoustic signals from sources along 
the axis AZ be passed with unity gain as provided in relationship (3) that follows: 



25 e^W(A:) = l (3) 

Here e is a two element vector which corresponds to the desired direction. When 
this direction is coincident with axis AZ, sensors 22 and 24 generally receive the 
signal at the same time and amplitude, and thus, for source 12 of the illustrated 

9 
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embodiment, the vector e is real- valued with equal weighted elements - for 
instance e^=[ 0.5 0.5 ]. In contrast, if the selected acoustic source is not on axis 
AZ, then sensors 22, 24 can be moved to align axis AZ with it. 

Li an additional or alternative mode of operation, the elements of vector e 
5 can be selected to monitor along a desired direction that is not coincident with axis 
AZ. For such operating modes, vector e becomes complex- valued to represent the 
appropriate time/phase delays between sensors 22, 24 that correspond to acoustic 
excitation off axis AZ. Thus, vector e operates as the direction indicator 
previously described. Correspondingly, alternative embodiments can be arranged 

10 to select a desired acoustic excitation source by establishing a different geometric 
relationship relative to axis AZ. For instance, the direction for monitoring a 
desired source can be disposed at a nonzero azimuthal angle relative to axis AZ. 
Indeed, by changing vector e, the monitoring direction can be steered from one 
direction to another without moving either sensor 22, 24. Procedure 520 described 

15 in connection with the flowchart of FIG. 10 hereinafter provides an example of a 
localization/tracking routine that can be used in conjunction with routine 140 to 
steer vector e. 

For inputs X^Cfc) and X^ik) that generally correspond to stationary random 
processes (which is typical of speech signals over small periods of time), the 
20 following weight vector W(k) relationship (4) can be determined from 
relationships (2) and (3): 

W(it) = jf^^^"'^ (4) 

25 where e is the vector associated with the desired reception direction, R(k) is the 
correlation matrix for the frequency, W(fc) is the optimal weight vector for the 
frequency and the superscript "-1" denotes the matrix inverse. The derivation 
of this relationship is explained in connection with a general model of the present 
invention applicable to embodiments with more than two sensors 22, 24 in array 

30 20. 

10 
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The correlation matrix R(A;) can be estimated from sj)ectral data obtained 
via a number "F" of fast discrete Fourier transforms (FFTs) calculated over a 
relevant time interval. For the two channel L, R embodiment, the correlation 
matrix for the 1^^^ frequency, R(A;), is expressed by the following relationsliip (5): 



R(A:) = 



M ^ 

— X^/*(n,fc)X;(n,A:) 

^ n=l 



M ^ 



Xu(k) X^kj 
X^ik) X^ik) 



(5) 



where Xi is the FFT in the frequency buffer for the left channel L and Xr is the FFT 
10 in the frequency buffer for right channel R obtained from previously stored FFTs 
that were calculated from an earlier execution of stage 146; "n" is an index to the 
number ''F" of FFTs used for the calculation; and "Af * is a regularization 
parameter. The terms Xii(k)y XiAJc), XrAJc), and XrAJc) represent the weighted sums 
for purposes of compact expression. It should be appreciated that the elements of 
15 the R(^) matrix are nonlinear, and therefore Y{k) is a nonlinear function of the 
inputs. 

Accordingly, in stage 148 spectra X^k) and Xf(k) previously stored in buffer 
54 are read from memory 50 in a First-In-First-Out (FIFO) sequence. Routine 140 
then proceeds to stage 150. ii stage 150, multiplier weights Wl(A:), Wr(^) are 

20 applied to and X,(Jc), respectively, in accordance with the relationship (1) for 
each frequency k to provide the output spectra Y{1c). Routine 140 continues with 
stage 152 which performs an Inverse Fast Fourier Transform (IFFT) to change the 
Y(fe) FFT determined in stage 150 into a discrete time domain form designated y(z). 
Next, in stage 154, a Digital-to-Analog (D/A) conversion is performed with D/A 

25 converter 84 (FIG. 2) to provide an analog output signal y{i). It should be 

understood that correspondence between Y(k) FFTs and output sample y(j) can 

vary. In one embodiment, there is one Y(k) FFT output for every y(z), providing a 

one-to-one correspondence. In another embodiment, there may be one Y(Jc) FFT 

for every 16 output samples y{z) desired, in which case the extra samples can be 

11 
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obtained from available YQc) FFTs. In still other embodiments, a different 
correspondence may be established. 

After conversion to the continuous time domain form, signal y{() is input to 
signal conditioner/filter 86. Conditioner/filter 86 provides the conditioned signal 
5 to output device 90. As illustrated in HG. 2, output device 90 includes an 

amplifier 92 and audio output device 94. Device 94 may be a loudspeaker, hearing 
aid receiver output, or other device as would occur to those skilled in the art. It 
should be appreciated that system 10 processes a binaural input to produce an 
monaural output. In some embodiments, this output could be further processed to 

10 provide multiple outputs. In one hearing aid application example, two outputs are 
provided that deliver generally the same sound to each ear of a user. In another 
hearing aid application, the sound provided to each ear selectively differs in terms 
of intensity and/or timing to account for differences in the orientation of the sound 
source to each sensor 22, 24, improving sound perception. 

15 After stage 154, routine 140 continues with conditional 156. In many 

applications it may not be desirable to recalculate the elements of weight vector 
W(A:) for every Y(Jc), Accordingly, conditional 156 tests whether a desired time 
interval has passed since the last calculation of vector W(fc). If this time period has 
not lapsed, then control flows to stage 158 to shift buffers 52, 54 to process the 

20 next group of signals. From stage 158, processing loop 160 closes, returning to 
conditional 144. Provided conditional 144 remains true, stage 146 is repeated for 
the next group of samples of xz.(z) and xr{z) to determine the next pair of XiSJC) and 
Xr(Jc) FFTs for storage in buffer 54. Also, with each execution of processing loop 
160, stages 148, 150, 152, 154 are repeated to process previously stored and 

25 Xjik) FFTs to determine the next YQc) FFT and correspondingly generate a 

continuous y(f). In this manner buffers 52, 54 are periodically shifted in stage 158 
with each repetition of loop 160 until either routine 140 halts as tested by 
conditional 144 or the time period of conditional 156 has lapsed. 

If the test of conditional 156 is true, then routine 140 proceeds from the 

30 affirmative branch of conditional 156 to calculate the correlation matrix R(^) in 
accordance with relationship (5) in stage 162. From this new correlation matrix 

12 
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R(*), an updated vector W(k) is determined in accordance with relationship (4) in 
stage 164. From stage 164, update loop 170 continues with stage 158 previously 
described, and processing loop 160 is re-entered until routine 140 halts per 
conditional 144 or the time for another recalculation of vector W(k) arrives. 
5 Notably, the time period tested in conditional 156 may be measured in temis of the 
number of times loop 160 is repeated, the number of EFTs or samples generated 
between updates, and the like. Alternatively, the period between updates can be 
dynamically adjusted based on feedback from an opemtor or monitoring device 
(not shown). 

10 When routine 140 initially starts, earlier stored data is not generally 

available. Accordingly, appropriate seed values may be stored in buffers 52, 54 in 
support of initial processing. In other embodiments, a greater number of acoustic 
sensors can be included in array 20 and routine 140 can be adjusted accordingly. 
For this more general form, the output can be expressed by relationship (6) as 

15 follows: 



where the XQc) is a vector with an entry for each of "C number of input channels 
20 and the weight vector W(k) is of like dimension. Equation (6) is the same at 

equation (1) but the dimension of each vector is C instead of 2. The output power 
can be expressed by relationship (7) as follows: 



where the correlation matrix R(k) is square with "C x C" diniensions. The vector e 
is the steering vector describing the weights and delays associated with a desired 
monitoring direction and is of the form provided by relationships (8) and (9) that 
follow: 



Yik) = W^(;fc)X(ifc) 



(6) 



E[Y(ifc)^]= E[W(fc)^X(ik)X"(fe)W(ik)] = W(ifc)"R(fc) W(ik) 



(7) 



30 
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<2>= (27tD/s/(cN))(sin(e)) for fc= 0,1. • • N-1 (9) 
where C is the number of array elements, c is the speed of sound in meters per 
second, and 0 is the desired "look direction." Thus, vector e may be varied with 
frequency to change the desired monitoring direction or look-direction and 
5 correspondingly steer the array. With the same constraint regarding vector e as 
described by relationship (3), the problem can be summarized by relationship (10) 
as follows: 

Minimize{W(fc)^R(A:)W(ife)} 

W(*) ^^^^ 

such that e°W(ik)=l 

This problem can be solved using the method of Lagrange multipliers generally 
10 characterized by relationship (1 1) as follows: 



Minimize { CostFunction + X* Constraint} (1 1) 

W(it) 



15 where the cost function is the output power, and the constraint is as listed above for 
vector e. A general vector solution begins with the Lagrange multiplier function 
H(W) of relationship (12): 

H(W) = iw(A:)"R(it)W(it)+>i(e^W(;t) -l) (12) 
2 

where the factor of one half (}/£) is introduced to simplify later math. Taking the 
20 gradient of H(W) with respect to W(k), and setting this result equal to zero, 
relationship (13) results as follows: 

V^H( W) = R(it)W(ik) + eA = 0 (13) 

Also, relationship (14) follows: 

W(ifc) = -R(*)"'e;L (14) 
25 Using this result in the constraint equation relationships (15) and (16) that follow: 

e«[-R(;k)"'eAj=l (15) 



A = -t''R(/k)"'e]~V 

14 



(16) 
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and using relationship (14), the optimal weights are as set forth in relationship 
(17): 

W,,, = R(/:)-^e[e^R(A:)-^e]"' (17) 

Because the bracketed term is a scalar, relationship (4) has this term in the 
denominator, and thus is equivalent. 

Returning to the two variable case for the sake of clarity, relationship (5) 
may be expressed more compactly by absorbing the weighted sums into the terms 
Xa, X/r, Xri and Xrr> and then renaming them as components of the correlation 
matrix R(^) per relationship (18): 



R(^) = 



Its inverse may be expressed in relationship (19) as: 









^12 











(18) 



R(^)"' = 



R72 

-R 



-R 



12 



21 



R 



11 



*, 



(19) 



det(R(A;)) 

where detQ is the determinant operator. Jf the desired monitoring direction is 
perpendicular to die sensor array, e = [0.5 0.5]\ the numerator of relationship (4) 
may then be expressed by relationship (20) as: 



R22 




'0.5' 


* 1 _ 


R^i ^12 


« 0-5 




Rn . 


0.5 


det(R(A;)) 


- R21 


det(R(ik)) 



(20) 



Using the previous result, the denominator is expressed by relationship (21) as: 

1 



e''R(it)''e = [0.5 0.5]* 



(^11 +^22^^12'~^l)' 



^22 ^12 



^ 

det(R(A:)) 



0.5 



(21) 



det(R(A:)) 

Canceling out the common factor of the determinant, the simplified relationship 
(22) is completed as: 

1 



L '2 J 



(i?j, + /?22 " ^12 " ^21 ) L^ll ~ ^1 - 



^22 ^12 



(22) 
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It can also be expressed in terms of averages of the sums of correlations between 
the two channels in relationship (23) as: 



lX„(k)-X^(k) 



(23) 



(24) 



where wi(k) and w^k) are the desired weights for the left and right channels, 
respectively, for the frequency, and the components of the correlation matrix are 
now expressed by relationships (24) as: 

X^{k)=\:^x;{n,k)X,M) 
^ fi=i 

(fc) = k)X, (n, k) 

just as in relationship (5). Thus, after computing the averaged sums (which may be 
kept as running averages), computational load can be reduced for this two channel 
embodiment. 

In a further variation of routine 140, a modified approach can be utilized in 
applications where gain differences between sensors of array 20 are negligible. 
For this approach, an additional constraint is utilized. For a two-sensor 
arrangement with a fixed on-axis steering direction and negligible inter-sensor gain 
differences, the desired weights satisfy relationship (25) as follows: 



Re[wJ = Re[w2] = J- 



(25) 



The variance minimization goal and unity gain constraint for this alternative 
approach correspond to the following relationships (26) and (27), respectively: 

(26) 



MinE[\Y,f ] 



H 



^ + Im[w2] 



= 1 



(27) 
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By inspection, when = [ 1 1 ], relationship (27) reduces to relationship (28) as 
follows: 

Im[wi ] = - Im[w2 ] (28) 

Solving for desired weights subject to the constraint in relationship (27) and using 
relationship (28) results in the following relationship (29): 



Im[^l2] 
-Im[/?i2l 



(29) 



The weights determined in accordance with relationship (29) can be used in 
place of those determined with relationships (22), (23), and (24); where R\\^ R12, 
R2U R22y ^ the same as those described in connection with relationship (18). 

10 Under appropriate conditions, this substitution typically provides comparable 
results with more efficient computation. When relationship (29) is utilized, it is 
generally desirable for the target speech or other acoustic signal to originate from 
the on-axis direction and for the sensors to be matched to one another or to 
otherwise compensate for inter-sensor differences in gain. Alternatively, 

15 localization information about sources of interest in each frequency band can be 
utilized to steer sensor array 20 in conjunction with the relationship (29) approach. 
This information can be provided in accordance with procedure 520 more fully 
described hereinafter in coimection with the flowchart of FIG. 10. 

Referring to relationship (5), regularization factor M typically is slightly 

20 greater than 1.00 to limit the magnitude of the weights in the event that the 
correlation matrix R(fc) is, or is close to being, singular, and therefore 
noninvertable. This occurs, for example, when time-domain input signals are 
exactly the same for F consecutive PFT calculations. It has been found that this 
form of regularization also can improve the perceived sound quality by reducing or 

25 eliminating processing artifacts common to time-domain beamformers. 

In one embodiment, regularization factor M is a constant. In other 

embodiments, regularization factor M can be used to adjust or otherwise control 

the array beamwidth, or the angular range at which a sound of a particular 

frequency can impinge on the array relative to axis AZ and be processed by routine 

30 140 without significant attenuation. This beamwidth is typically larger at lower 
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15 



20 



frequencies than higher frequencies, and can be expressed by the following 
relationship (30): 



r=l-M, where M is the regulaiization factor, as in relationship (5), c represents the 
speed of sound in meters per second (na/s), /represents frequency in Hertz (Hz), D 
is the distance between microphones in meters (m). For relationship (30), 
Beamwidth-jdB defines a beamwidth that attenuates the signal of interest by a 
relative amount less than or equal to three decibels (dB). It should be understood 
that a different attenuation threshold can be selected to define beamwidth in other 
embodiments of the present invention. FIG. 9 provides a graph of four Unes of 
different pattems to represent constant values 1.001, 1.005, 1.01, and 1.03, of 
regularization factor M, respectively, in terms of beamwidth versus frequency. 

Per relationship (30), as frequency increases, beamwidth decreases; and as 
regularization factor M increases, the beamwidth increases. Accordingly, in one 
alternative embodiment of routine 140, regularization factor M is increased as a 
function of frequency to provide a more uniform beamwidth across a desired range 
of frequencies. In another embodiment of routine 140, M is altematively or 
additionally varied as a function of time. For example, if little interference is 
present in the input signals in certain frequency bands, the regularization factor M 
can be increased in those bands. It has been found that beamwidth increases in 
frequency bands with low or no inference commonly provide a better subjective 
sound quality by limiting the magnitude of the weights used in relationships (22), 
(23), and/or (29). In a further variation, this improvement can be complemented 
by decreasing regularization factor M for frequency bands that contain 
interference above a selected threshold. It has been found that such decreases 
conunonly provide more accurate filtering, and better cancellation of interference. 
In still another embodiment, regularization factor M varies in accordance with an 
adaptive function based on frequency-band-specific interference. Li yet further 




V 



(30) 



18 



wo 01/87011 PCT/USOl/15047 

embodiments, regularization factor M varies in accordance with one or more other 
relationships as would occur to those skilled in the art. 

Referring to FIG, 4, one application of the various embodiments of the 
present invention is depicted as hearing aid system 210; where like reference 
5 numerals refer to like features. In one embodiment, system 210 includes 

eyeglasses G and acoustic sensors 22 and 24. Acoustic sensors 22 and 24 are fixed 
to eyeglasses G in this embodiment and spaced apart from one another, and are 
operatively coupled to processor 30. Processor 30 is operatively coupled to output 
device 190. Output device 190 is in the form of a hearing aid earphone and is 
10 positioned in ear E of the user to provide a corresponding audio signal. For system 
210, processor 30 is configured to perform routine 140 or its variants with the 
output signal y(t) being provided to output device 190 instead of output device 90 
of FIG. 2. As previously discussed, an additional output device 190 can be 
coupled to processor 30 to provide sound to another ear (not shown). This 
15 arrangement defines axis AZ to be perpendicular to the view plane of FIG. 4 as 
designated by the like labeled cross-hairs located generally midway between 
sensors 22 and 24. 

In operation, the user wearing eyeglasses G can selectively receive an 
acoustic signal by aligning the corresponding source with a designated direction, 
20 such as axis AZ. As a result, sources from other directions are attenuated. 

Moreover, the wearer may select a different signal by realigning axis AZ with 
another desired sound source and correspondingly suppress a different set of off- 
axis sources. Alternatively or additionally, system 210 can be configured to 
operate with a reception direction that is not coincident with axis AZ. 
25 Processor 30 and output device 190 may be separate units (as depicted) or 

included in a conmion unit worn in the ear. The coupling between processor 30 
and output device 190 may be an electrical cable or a wireless transmission. In one 
alternative embodiment, sensors 22, 24 and processor 30 are remotely located 
relative to each other and are configured to broadcast to one or more output 
devices 190 situated in the ear E via a radio frequency transmission. 
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In a further hearing aid embodiment, sensors 22, 24 are sized and shaped to 
fit in the ear of a listener, and the processor algorithms are adjusted to account for 
shadowing caused by the head, torso, and pinnae. This adjustment may be 
provided by deriving a Head-Related-Transfer-Function (HRTF) specific to the 
5 listener or from a population average using techniques known to those skilled in 
the art This function is then used to provide appropriate weightings of the output 
signals that compensate for shadowing. 

Another hearing aid system embodiment is based on a cochlear implant. A 
cochlear implant is typically disposed in a middle ear passage of a user and is 

10 configured to provide electrical stimulation signals along the middle ear in a 

standard manner. The implant can include some or all of processing subsystem 30 
to operate in accordance with the teachings of the present invention. Altematively 
or additionally, one or more external modules include some or all of subsystem 30. 
Typically a sensor array associated with a hearing aid system based on a cochlear 

15 implant is worn externally, being arranged to communicate with the implant 
through wires, cables, and/or by using a wireless technique. 

Besides various forms of hearing aids, the present invention can be applied 
in other configurations. For instance, FIG. 5 shows a voice input device 310 
employing the present invention as a front end speech enhancement device for a 

20 voice recognition routine for personal computer C; where like reference niunerals 
refer to hke features. Device 310 includes acoustic sensors 22, 24 spaced apart 
from each other in a predetermined relationship. Sensors 22, 24 are operatively 
coupled to processor 330 within computer C. Processor 330 provides an output 
signal for internal use or responsive reply via speakers 394a, 394b and/or visual 

25 display 396; and is arranged to process vocal inputs from sensors 22, 24 in 

accordance with routine 140 or its variants. In one mode of operation, a user of 
computer C aligns with a predetemiined axis to deliver voice inputs to device 310. 
In another mode of operation, device 310 changes its monitoring direction based 
on feedback from an operator and/or automatically selects a monitoring direction 

30 based on the location of the most intense sound source over a selected period of 
time. Altematively or additionally, the source localization/tracking ability 

20 
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provided by procedure 520 as illustrated in the flowchart of FIG. 10 can be 
utilized. In still another voice input application, the directionally selective speech 
processing features of the present invention are utilized to enhance performance of 
a hands-free telephone, audio surveillance device, or other audio system. 
5 Under certain circumstances, the directional orientation of a sensor array 

relative to the target acoustic source changes. Without accounting for such 
changes, attenuation of the target signal can result. This situation can arise, for 
example, when a binaural hearing aid wearer turns his or her head so that he or she 
is not aligned properly with the target source, and the hearing aid does not 

10 otherwise account for this misalignment It has been found that attenuation due to 
misalignment can be reduced by localizing and/or tracking one or more acoustic 
sources of interests. The flowchart of FIG. 10 illustrates procedure 520 to track 
and/or localize a desired acoustic source relative to a reference. Procedure 520 can 
be utilized for a hearing aid or in other apphcations such as a voice input device, a 

15 hands-free telephone, audio surveillance equipment, and the like — either in 

conjimction with or independent of previously described embodiments. Procedure 
520 is described as follows in terms of an implementation with system 10 of FIG. 
1. For this embodiment, processing system 30 can include logic to execute one or 
more stages and/or conditionals of procedure 520 as appropriate. In other 

20 embodiments, a different arrangement can be used to implement procedure 520 as 
would occur to one skilled in the art. 

Procedure 520 starts with A/D conversion in stage 522 in a manner like 
that described for stage 142 of routine 140. From stage 522, procedure 520 
continues with stage 524 to transform the digital data obtained from stage 522, 

25 such that "G" number of FFTs are provided each with "iV * number of FFT 
frequency bins. Stages 522 and 524 can be executed in an ongoing fashion, 
buffering the results periodically for later access by other operations of procedure 
520 in a parallel, pipelined, sequence-specific, or different manner as would occur 
to one skilled in the art. With the FFTs from stage 524, an array of localization 

30 results, P( Y)y can be described in terms of relationships (31)-(35) as follows: 
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k=0 n 



(31) 



V 



J 



Y = [-90°, - 89°. - 88°, , 89°, 90°] 

d{0^)^l, e^^Y mid 

|LU,fc)| + |lJ(g,A:)|^M,;,,(^) 

= 0, Qx^Y 

\xig,kj[>\ or 

|L(«,fc)| + |i?(g,fc)|<M,;^(A:) 

0^ = ROUND(sin"^(x(g,ifc)) ) 



(32) 



(33) 



(34) 



^(g»*^) = 



{ZLig, k) - ZRig. k) ± 2mi) (35) 



where the operator "INT" returns the integer part of its operand, L(jg,K) and Rig,k) 
are the firequency-domain data from channels L and R, respectively, for the 1^ EFT 
frequency bin of the FPT, MthrQO is a threshold value for the frequency-domain 
data in FFT frequency bin kj the operator 'HOUND" returns the nearest integer 

10 degree of its operand, c is the speed of sound in meters per second,^ is the 

sampling rate in Hertz, and D is the distance (in meters) between the two sensors 
of array 20. For these relationships, array P(y) is defined with 181 azimuth 
location elements, which correspond to directions —90° to +90° in 1° increments. In 
other embodiments, a different resolution and/or location indication technique can 

15 be used. 

From stage 524, procedure 520 continues with index initialization stage 526 
in which index g to the G numb^ of FFTs and index k to the N frequency bins of 
each FFT are set to one and zero, (g=l, fc=0), respectively. From stage 526, 
procedure 520 continues by entering frequency bin processing loop 530 and FFT 
20 processing loop 540. For this example, loop 530 is nested within loop 540. Loops 
530 and 540 begin with stage 532. 
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For an off-axis acoustic source, the corresponding signal travels different 
distances to reach each of the sensors 22, 24 of array 20. Generally, these different 
distances cause a phase difference between channels L and R at some frequency. 
In stage 532, routine 520 determines the difference in phase between channels L 
5 and R for the current frequency bin of the FFT g, converts the phase difference to 
a difference in distance, and determines the ratio x(g,k) of this distance difference 
to the sensor spacing D in accordance with relationship (35). Ratio x(g,k) is used 
to find the signal angle of arrival ftp, rounded to the nearest degree, in accordance 
with relationship (34). 

10 Conditional 534 is next encountered to test whether the signal energy level 

in chaimels L and R have more energy than a threshold level Moir and the value of 
x(gjc) was one for which a valid angle of arrival could be calculated. If both 
conditions are met, then in stage 535 a value of one is added to the corresponding 
element of P(y\ where y= Ox- Procedure 520 proceeds from stage 535 to 

15 conditional 536. Jf neither condition of conditional 534 is met, then P(y) is not 

modified, and procedure 520 bypasses stage 535, continuing with conditional 536. 

Conditional 536 tests if all the frequency bins have been processed, that is 
whether index k equals iV, the total number of bins. If not (conditional 536 test is 
negative), procedure 520 continues with stage 537 in which index k is incremented 

20 by one (k=k+l). From stage 537, loop 530 closes, returning to stage 532 to process 
the new g and k combination. If the conditional 536 test is affirmative, conditional 
542 is next encountered, which tests if all FFTs have been processed, that is 
whether index g equals G number of FFTs. If not (conditional 542 is negative), 
procedure 520 continues with stage 544 to increment g by one (g=g+l) and to reset 

25 k to zero (fc=0). From stage 544, loop 540 closes, returning to stage 532 to process 
the new g and k combination. If conditional test 542 is affirmative, then all N bins 
for each of the G number of FFTs have been processed, and loops 530 and 540 are 
exited. 

With the conclusion of processing by loops 530 and 540, the elements of 
30 array P(y) provide a measxire of the likelihood that an acoustic source corresponds 
to a given direction (azimuth in this case). By examining P(y)j an estimate of the 
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spatial distribution of acoustic sources at a given moment in time is obtained. 
From loops 530, 540, procedure 520 continues with stage 550. 

In stage 550, the elements of array P(y) having the greatest relative values, 
or "peaks," are identified in accordance with relationship (36) as follows: 
5 p(l) = PEAKS {P(r)^ Yiim . Pthr ) (36) 

where /?(/) is direction of the peak in the function P(y) for values of y between 
±y/im (a typical value for yum is 10^, but this may vary significantly) and for which 
the peak values are above the threshold value Ptttr- The PEAKS operation of 
relationship (36) can use a number of peak-finding algorithms to locate maxima of 

10 the data, including optionally smoothing the data and other operations. 

From stage 550, procedure 520 continues with stage 552 in which one or 
more peaks are selected. When tracking a source that was initially on-axis, the 
peak closest to the on-axis direction typically corresponds to the desired source. 
The selection of this closest peak can be performed in accordance with relationship 

15 (37) as follows: 

0^ = rmn\p{}l (37) 

where Otar is the direction angle of the chosen peak. Regardless of the selection 

criteria, procedure 520 proceeds to stage 554 to apply the selected peak or peaks. 

Procedure 520 continues from stage 554 to conditional 560. Conditional 560 tests 
20 whether procedure 520 is to continue or not. If the conditional 560 test is true, 

procedure 520 loops back to stage 522. If the conditional 560 test is false, 

procedure 520 halts. 

In an application relating to routine 140, the peak closest to axis AZ is 

selected, and utilized to steer array 20 by adjusting steering vector In this 
25 application, vector e is modified for each frequency bin k so that it corresponds to 

the closest peak direction Otar- For a steering direction of 0tar> tbe vector e can be 

represented by the following relationship (38), which is a simplified version of 

relationships (8) and (9): 
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e = [l e^'^f 



flTt'D f^ . . A (38) 



where k is the FFT frequency bin number, D is the distance in meters between 
sensors 22 and 24,^ is the sampling frequency in Hertz, c is the speed of sound in 
meters per second, JV is the nimiber of EFT frequency bins and dtar is obtained from 
5 relationship (37). For routine 140, the modified steering vector e of relationship 
(38) can be substituted into relationship (4) of routine 140 to extract a signal 
originating from direction Otar* Likewise, procedure 520 can be integrated with 
routine 140 to perform localization with the same FFT data. In other words, the 
A/D conversion of stage 142 can be used to provide digital data for subsequent 

10 processing by both routine 140 and procedure 520. Alternatively or additionally, 
some or all of the FFTs obtained for routine 140 can be used to provide the G FFTs 
for procedure 520. Moreover, beamwidth modifications can be combined with 
procedure 520 in various applications either with or without routine 140. In still 
other embodiments, the indexed execution of loops 530 and 540 can be at least 

15 partially performed in parallel with or without routine 140. 

In a further embodiment, one or more transformation techniques are 
utilized in addition to or as an alternative to fourier transforms in one or more 
forms of the invention previously described. One example is the wavelet 
transform, which mathematically breaks up the time-domain waveform into many 

20 simple waveforms, which may vary widely in shape. Typically wavelet basis 

functions are similarly shaped signals with logarithmically spaced frequencies. As 
frequency rises, the basis functions become shorter in time duration with the 
inverse of frequency. Like fourier transforms, wavelet transforms represent the 
processed signal with several different components that retain amplitude and phase 

25 information. Accordingly, routine 140 and/or routine 520 can be adapted to use 
such alternative or additional transformation techniques. In general, any signal 
transform components that provide amplitude and/or phase information about 
different parts of an input signal and have a corresponding inverse transformation 
can be applied in addition to or in place of FFTs. 
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15 



Routine 140 and the variations previously described generally adapt more 
quickly to signal changes than conventional time-domain iterative-adaptive 
schemes. In certain applications where the input signal changes rapidly over a 
small interval of time, it may be desired to be more responsive to such changes. 
For these applications, the F number of FFTs associated with correlation matrix 
Viik) may provide a more desirable result if it is not constant for all signals 
(alternatively designated the correlation length F). Generally, a smaller correlation 
length F is best for rapidly changing input signals, while a larger correlation length 
F is best for slowly changing input signals. 

A varying correlation length F can be implemented in a number of ways. 
In one example, filter weights are determined using different parts of the 
frequency-domain data stored in the correlation buffers. For buffer storage in the 
order of the time they are obtained (First-In, First-Out (FIFO) storage), the first 
half of the correlation buffer contains data obtained from the first half of the 
subject time interval and the second half of the buffer contains data from the 
second half of this time interval. Accordingly, the correlation matrices Ri(fc) and 
R2(fc) can be determined for each buffer half according to relationships (39) and 
(40) as follows: 



F 



F 




(39) 




Yx;in,k)Xt{Ti,k) 



4 ^X; (n,k)X ,(n,k) 



R2 ik) = 




2 



(40) 

BXk) can be obtained by summing correlation matrices Ri(A:) and R2(fc). 
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Using relationship (4) of routine 140, filter coefficients (weights) can be 
obtained using both Ri(k) and RiW- If the weights differ significantly for sonae 
frequency band k between RiCJt) and R2(fc), a significant change in signal statistics 
may be indicated. This change can be quantified by examining the change in one 
5 weight through determining the magnitude and phase change of the weight and 
then using these quantities in a function to select the appropriate correlation length 
F. The magnitude difference is defined according to relationship (41) as follows: 

AM(fe) = I Kx(A:)| -1^1,2(^)1 1 (41) 

where Wi4(fc) and v^ijzOO are the weights calculated for the left channel using Ri(fc) 
10 and R2(Jt), respectively. The angle difference is defined according to relationship 
(42) as follows: 

AA(k) = I min(ai - Zw^^ (^)» ^2 - ^^l2 «3 " ^'^l2 (^)) | 

ai=Zw^,(fe) ^^2) 

= Z>v^i(fc) — 27U 

where the factor of ±27c is introduced to provide the actual phase difference in the 
case of a ±27r jump in the phase of one of the angles. 
15 The correlation length F for some frequency bin k is now denoted as F(ky 

An example function is given by the following relationship (43): 

Fik) = max(Z;(fc) • AA{k) + d{k) • AM(fc) + c„,^(/:), c^/„ (*^)) (43) 

where Cmin(k) represents the minimum correlation length, c,naxih) represents the 
maximvun correlation length and b(Jc) and <i(fc) are negative constants, all for the k 

20 fi-equency band. Thus, as AA(A:) and AM(fc) increase, indicating a change in the 
data, the output of the function decreases. With proper choice of £>(fc) and d(fc), 
FQc) is limited between Cmm(k) and CmajJik), so that the correlation length can vary 
only within a predetermined range. It should also be understood that F(*) may take 
different forms, such as a nonlinear function or a function of other measures of the 

25 input signals. 

Values for function F(fe) are obtained for each frequency bin fc. It is 
possible that a small number of correlation lengths may be used, so in each 
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frequency bin k the correlation length that is closest to Fi(k) is used to form R(ky 
This closest value is found using relationship (44) as follows: 

= imn{\Fi(k) - c(i)\ ), c(i) = [c^i^,C2,c^y..>.,c^ax^ 

' (44) 

where imm, is the index for the imnimized function F(Jc) and c{i) is the set of 
5 possible correlation length values ranging from Cmin to Cmax- 

The adaptive correlation length process described in connection with 
relationships (39)-(44) can be incorporated into the correlation matrix stage 162 
and weight determination stage 164 for use in a hearing aid, such as that described 
in connection with FIG. 4, or other applications like surveillance equipment, voice 

10 recognition systems, and hands-free telephones, just to name a few. Logic of 
processing subsystem 30 can be adjusted as appropriate to provide for this 
incorporation. Optionally, the adaptive correlation length process can be utilized 
with the relationship (29) approach to weight computation, the dynamic beamwidth 
legularization factor variation described in connection with relationship (30) and 

15 FIG. 9, the localization/tracking procedure 520, alternative transformation 

embodiments, and/or such different embodiments or variations of routine 140 as 
would occur to one skilled in the art. The application of adaptive correlation 
length can be operator selected and/or automatically applied based on one or more 
measured parameters as would occur to those skilled in the art. 

20 Many other further embodiments of the present invention are envisioned. 

One further embodiment includes: detecting acoustic excitation with a number of 
acoustic sensors that provide a number of sensor signals; establishing a set of 
frequency components for each of the sensor signals; and determining an output 
signal representative of the acoustic excitation from a designated direction. This 

25 determination includes weighting the set of frequency components for each of the 
sensor signals to reduce variance of the output signal and provide a predefined gain 
of the acoustic excitation from the designated direction. 

In another embodiment, a hearing aid includes a nmnber of acoustic sensors 
in the presence of multiple acoustic sources that provide a corresponding number 

30 of sensor signals. A selected one of the acoustic sources is monitored. An output 
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signal representative of the selected one of the acoustic sources is generated. This 
output signal is a weighted combination of the sensor signals that is calculated to 
mininiize variance of the output signal. 

A still further embodiment includes: operating a voice input device 
5 including a number of acoustic sensors that provide a corresponding number of 
sensor signals; determining a set of frequency components for each of the sensor 
signals; and generating an output signal representative of acoustic excitation from a 
designated direction. This output signal is a weighted combination of the set of 
frequency components for each of the sensor signals calculated to minimize 

10 variance of the output signal. 

Yet a further embodiment includes an acoustic sensor array opemble to 
detect acoustic excitation that includes two or more acoustic sensors each operable 
to provide a respective one of a number of sensor signals. Also included is a 
processor to determine a set of frequency components for each of the sensor 

15 signals and generate an output signal representative of the acoustic excitation from 
a designated direction. This ou^ut signal is calculated from a weighted 
combination of the set of frequency components for each of the sensor signals to 
reduce variance of the output signal subject to a gain constraint for the acoustic 
excitation from the designated direction. 

20 A further embodiment includes: detecting acoustic excitation with a 

number of acoustic sensors that provide a corresponding number of signals; 
establishing a number of signal transform components for each of these signals; 
and determining an ou^ut signal representative of acoustic excitation from a 
designated direction. The signal transform components can be of the frequency 

25 domain type. Altematively or additionally, a determination of the output signal 
can include weighting the components to reduce variance of the output signal and 
provide a predefined gain of the acoustic excitation from the designated direction. 

In yet another embodiment, a hearing aid is operated that includes a number 
of acoustic sensors. These sensors provide a corresponding number of sensor 

30 signals. A direction is selected to monitor for acoustic excitation with the hearing 
aid. A set of signal transform components for each of the sensor signals is 
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determined and a number of weight values are calculated as a function of a 
correlation of these components, an adjustment factor, and the selected direction. 
The signal transform components are weighted with the weight values to provide 
an output signal representative of the acoustic excitation emanating from the 
5 direction. The adjustment factor can be directed to correlation length or a 
beamwidth control parameter just to name a few examples. 

For a further embodiment, a hearing aid is operated that includes a number 
of acoustic sensors to provide a corresponding nimiber of sensor signals. A set of 
signal transform components are provided for each of the sensor signals and a 

10 number of weight values are calculated as a function of a correlation of the 
transform components for each of a number of different frequencies. This 
calculation includes applying a first beamwidth control value for a first one of the 
frequencies and a second beamwidth control value for a second one of the 
frequencies that is different than the first value. The signal transform components 

15 are weighted with the weight values to provide an output signal. 

For another embodiment, acoustic sensors of the hearing aid provide 
corresponding signals that are represented by a plurality of signal transform 
components, A first set of weight values are calculated as a function of a first 
correlation of a first number of these components that correspond to a first 

20 correlation length. A second set of weight values are calculated as a function of a 
second correlation of a second number of these components that correspond to a 
second correlation length different than the first correlation length. An output 
signal is generated as a function of the first and second weight values. 

In another embodiment, acoustic excitation is detected with a number of 

25 sensors that provide a corresponding number of sensor signals. A set of signal 
transform components is determined for each of these signals. At least one 
acoustic source is localized as a function of the transform components. In one 
form of this embodiment, the location of one or more acoustic sources can be 
tracked relative to a reference. Alternatively or additionally, an output signal can 

30 be provided as a function of the location of the acoustic source determined by 
localization and/or tracking, and a correlation of the transform components. 
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« 

It is contemplated that various signal flow operators, converters, functional 
blocks, generators, units, stages, processes, and techniques may be altered, 
rearranged, substituted, deleted, duplicated, combined or added as would occur to 
those skilled in the art without departing from the spirit of the present inventions. 
5 It should be understood that the operations of any routine, procedure, or variant 
thereof can be executed in parallel, in a pipeline manner, in a specific sequence, as 
a combination of these appropriate to the interdependence of such operations on 
one another, or as would otherwise occur to those skilled in the art. By way of 
nonlimiting example, A/D conversion, D/A conversion, FFT generation, and FFT 

10 inversion can typically be performed as other operations are being executed. These 
other operations could be directed to processing of previously stored A/D or signal 
transform components, such as stages 150, 162, 164, 532, 535, 550, 552, and 554, 
just to name a few possibilities. In another nonlimiting example, the calculation of 
weights based on the current input signal can at least overlap the application of 

15 previously determined weights to a signal about to be output. All publications and 
patent applications cited in this specification are herein incorporated by reference 
as if each individual publication or patent application were specifically and 
individually indicated to be incorporated by reference. 
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EXPERIMENTAL SECTION 

The following experimental results provide nonlimiting examples, and 
should not be constmed to restrict the scope of the present invention. 
5 FIG, 6 illustrates the experimental set-up for testing the present invention. 

The algorithm has been tested with real recorded speech signals, played through 
loudspeakers at different spatial locations relative to the receiving microphones in 
an anechoic chamber. A pair of microphones 422, 424 (Sennheiser MKE 2-60) 
with an inter-microphone distance D of 15 cm, were situated in a listening room to 

10 serve as sensors 22, 24 . Various loudspeakers were placed at a distance of about 3 
feet from the midpoint M of the microphones 422, 424 corresponding to different 
azimuths. One loudspeaker was situated in front of the microphones that 
intersected axis AZ to broadcast a target speech signal (corresponding to source 12 
of FIG. 2). Several loudspeakers were used to broadcast words or sentences that 

15 interfere with the listening of target speech from different azimuths. 

Microphones 422, 424 were each operatively coupled to a Mic-to-Line 
preamp 432 (Shure FP-11). The output of each preamp 432 was provided to a dual 
channel volume control 434 provided in the form of an audio preampUfier (Adcom 
GTP-5511). The output of volume control 434 was fed into AID converters of a 

20 Digital Signal Processor (DSP) development board 440 provided by Texas 
Instruments (model number TI-C6201 DSP Evaluation Module (EVM)). 
Development board 440 includes a fixed-point DSP chip (model number 
TMS320C62) running at a clock speed of 133MHz with a peak throughput of 1064 
MIPS (naillions of instractions per second). This DSP executed software 

25 configured to implement routine 140 in real-time. The sampling frequency for 

these experiments was about 8 kHz with 16-bit A/D and D/A conversion. The FFT 
length was 256 samples, with an FFT calculated every 16 samples. The 
computation leading to the characterization and extraction of the desired signal was 
found to introduce a delay in a range of about 10-20 milliseconds between the 

30 input and output. 
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FIGs. 7 and 8 each depict traces of three acoustic signals of approximately 
the same energy. In FIG. 7, the target signal trace is shown between two 
interfering signals traces broadcast from azimuths 22° and -65°, respectively. 
These azimuths are depicted in FIG. 1. The target sound is a prerecorded voice 
5 from a female (second trace), and is emitted by the loudspeaker located near 0°. 
One interfering sound is provided by a female talker (top trace of FIG. 7) and the 
other interfering sound is provided by a male talker (bottom trace of FIG, 7). The 
phrase repeated by the corresponding talker is reproduced above the respective 
trace. 

10 Referring to FIG. 8, as revealed by the top trace, when the target speech 

sound is emitted in the presence of two interfering sources, its waveform (and 
power spectrum) is contaminated. This contaminated sound was difficult to 
understand for most listeners, especially those with hearing impairment. Routine 
140, as embodied in board 440, processed this contaminated signal with high 

15 fidelity and extracted the target signal by markedly suppressing the interfering 

sounds. Accordingly, intelligibility of the target signal was restored as illustrated 
by the second trace. The intelligibility was significantly improved and the 
extracted signal resembled the original target signal reproduced for comparative 
purposes as the bottom trace of FIG 8. 

20 These experiments demonstrate marked suppression of interfering sounds. 

The use of the regularization parameter (valued at approximately 1.03) effectively 
limited the magnitude of the calculated weights and results in an output with much 
less audible distortion when the target source is slightly off-axis, as would occur 
when the hearing aid wearer's head is slightly misaligned to the target talker. 

25 Miniaturization of this technology to a size suitable for hearing aids and other 
applications can be provided using techniques known to those skilled in the art. 

FIGS. 1 1 and 12 are computer generated image graphs of simulated results 
for procedure 520. These graphs plot localization results of azimuth in degrees 
versus time in seconds. The localization results are plotted as shading, where the 

30 darker the shading, the stronger the localization result at that angle and time. Such 
simulations are accepted by those skilled in the art to indicate efficacy of this type 
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of procedure. 

FIG. 11 illustrates the localization results when the target acoustic source is 
generally stationary with a direction of about 10** off-axis. The actual direction of 
the target is indicated by a solid black line. FIG. 12 illustrates the localization 
5 results for a target with a direction that is changing sinusoidally between +10** and 
-10**, as might be the case for a hearing aid wearer shaking his or her head. The 
actual location of the source is again indicated by a solid black line. The 
localization technique of procedure 520 accurately indicates the location of the 
target source in both cases because the darker shading matches closely to the actual 

10 location lines. Because the target source is not always producing a signal free of 
interference overlap, localization results may be strong only at certain times. In 
FIG. 12, these stronger intervals can be noted at about 0,2, 0.7, 0.9, 1.25, 1.7, and 
2.0 seconds. It should be understood that the target location can be readily 
estimated between such times. 

15 Experiments described herein are simply for the purpose of demonstrating 

operation of one form of a processing system of the present invention. The 
equipment, the speech materials, the talker configurations, and/or the parameters 
can be varied as would occur to those skilled in the art. 

Any theory, mechanism of operation, proof, or finding stated herein is 

20 meant to further enhance understanding of the present invention and is not 

intended to make the present invention in any way dependent upon such theory, 
mechanism of operation, proof, or finding. While the invention has been 
illustrated and described in detail in the drawings and foregoing description, the 
same is to be considered as illustrative and not restrictive in character, it being 

25 understood that only the selected embodiments have been shown and described and 
that all changes, modifications and equivalents that come within the spirit of the 
invention as defined herein or by the following claims are desired to be protected. 
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What is claimed is: 
1 . A method, comprising: 
detecting acoustic excitation with a number of acoustic sensors, the 
acoustic sensors providing a corresponding number of sensor signals; 
S establishing a number of frequency domain components for each of the 

sensor signals; and 

determining an output signal representative of the acoustic excitation from 
a designated direction, said determining including weighting the components for 
each of the sensor signals to reduce variance of the output signal and provide a 
10 predefined gain of the acoustic excitation from the designated direction. 

2. The method of claim 1, wherein said determining includes 
minimizing the variance of the output signal and the predefined gain is 
approximately unity. 

3. The method of claim 1 , further comprising changing the designated 
IS direction without moving any of the acoustic sensors and repeating said 

establishing and said determining after said changing. 

4. The method of claim 1, further comprising changing from the 
designated direction by moving one or more of the acoustic sensors and repeating 
said establishing and said determining after said changing. 

20 5. The method of claim 1, wherein said components correspond to 

fourier transforms and said weighting includes calculating a number of weights to 
minimize the variance of the output signal subject to a constraint that the 
predefined gain be generally maintained at unity, the weights being determined as 
a function of a frequency domain correlation matrix and a vector corresponding to 

25 the designated direction. 

6. The method of claim 5, furth^ comprising recalculating the weights 
from time to time and repeating said establishing and said determining on an 
established basis. 

7. The method of claim 1, further comprising calculating said weights 
30 subject to a constraint of an insubstantial level of gain difference between the 

acoustic sensors. 
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8. The method of claim 1, further comprising adjusting a correlation 
factor to control beamwidth as a function of frequency. 

9. The method of claim 1, further comprising calculating a number of 
correlation matrices and adaptively changing correlation length for one or more of 

S the correlation matrices relative to at least one oth^ of the correlation matrices. 

10. The method of claim 1, further comprising tracking location of at 
least one acoustic signal source as a function of a phase difference between the 
acoustic sensors. 

11. The method of any of claims 1-10, further comprising providing a 
10 hearing aid with the acoustic sensors and a processor operable to perform said 

estabUshing and said determining. 

12. The method of any of claims 1-10, wherein a voice input device 
includes the acoustic sensors and a processor operable to perform said establishing 
and said determining. 

15 13. A method, comprising: 

operating a hearing aid including a number of acoustic sensors in the 
presence of multiple acoustic sources, the acoustic sensors providing a 
corresponding number of sensor signals; 

monitoring a selected one of the acoustic sources; 
20 determining a set of frequency components for each of the sensor signals; 

and 

genemting an output signal representative of the selected one of the 
acoustic sources, the output signal being a weighted combination of the set of 
frequency components for each of the sensor signals calculated to minimize 
25 variance of the output signal. 

14. The method of claim 13, further comprising processing the output 
signal to provide at least one acoustic output to a user of the hearing aid. 

15. A method, comprising: 

operating a voice input device including a number of acoustic sensors, the 
30 acoustic sensors providing a corresponding number of sensor signals; 
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determining a set of frequency components for each of the sensor signals; 

and 

generating an output signal representative of acoustic excitation from a 
designated direction, the output signal being a weighted combination of the set of 
5 frequency components for each of the sensor signals calculated to minimize 
variance of the output signal. 

16. The method of claim 15, wherein the voice input device is included 
in a voice recognition system for a computer. 

17. The method of any of claims 13-16, wherein said generating 
10 includes calculating a number of weights as a function of a frequency domain 

correlation matrix and a vector corresponding to the designated direction. 

18. The method of claim 17, further comprising recalculating the 
weights from time to time. 

19. The method of claim 17, further comprising determining the 
15 weighted combination of the sensor signals as a function of a gain constraint 

associated with the designated direction. 

20. The method of claim 17, further comprising adjusting a correlation 
factor to control beamwidth as a function of frequency. 

21. The method of claim 17, further comprising adaptively changing 
20 correlation length. 

22. A method, comprising: 

operating a hearing aid including a number of acoustic sensors, the acoustic 
sensors providing a corresponding number of sensor signals; 

selecting a direction to monitor for acoustic excitation with the hearing aid; 
25 determining a set of signal transform components for each of the sensor 

signals; 

calculating a number of weight values as a function of a correlation of the 
signal transform components, an adjustment factor, and the direction; and 

weighting the signal transform components with the weight values to 
30 provide an output signal representative of the acoustic excitation emanating from • 
the direction. 
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23. The method of claim 22, wherein the transfomi components correspond to 
different frequencies and the adjustment factor has a first value for a first one of the 
frequencies and second value different than the first value for a second one of the 
frequencies to control beamwidth. 
5 24. The method of claim 22, wherein the adjustment factor corresponds to 

correlation length and further comprising determining a number of different correlations 
with correlation length adaptively changed in accordance with different values for the 
adjustment factor. 

25. The method of claim 22, further comprising: 
10 determining a level of interference; and 

adjusting the beamwidth of the hearing aid in response to the level of 
interference with the adjustment factor. 

26. The method of claim 22, further comprising: 

determining a rate of change of at least one frequency of at least one of the sensor 
15 signals with respect to time; and 

adjusting the correlation length in response to the rate of change with the 
adjustment factor. 

27. A method, comprising: 

operating a hearing aid including a number of acoustic sensors, the acoustic 
20 sensors providing a corresponding number of sensor signals; 

providing a set of signal transform components for each of the sensor 
signals; 

calculating a number of weight values as a function of a correlation of the 
transform components for each of a number different frequencies, said calculating 
including applying a first beamwidth control value for a first one of the frequencies 
and a second beamwidth control value for a second one of the frequencies different 
than the first beamwidth control value; and 

weighting the signal transform components with the weight values to 
provide an output signal. 
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28. The method of claim 27, further comprising selecting the first 
beamwidth value and the second beamwidth value to provide a generally constant 
beamwidth of the hearing aid over a predefined frequency range. 

29. The method of claim 27, wherein the first beamwidth value and the 
5 second beamwidth value differ in accordance with a difference in an amount of 

interference at the first one of the frequencies relative to the second one of the 
frequencies. 

30. A method, comprising: 

operating a hearing aid including a number of acoustic sensors, the acoustic 
10 sensors providing a corresponding number of sensor signals; 

providing a first plurality of signal transform components for the sensor 
signals; 

calculating a first set of weight values as a function of a first correlation of 
the first signal transform components corresponding to a first correlation length; 
IS providing a second plurality of signal transform components for the sensor 

signals; 

calculating a second set of weight values as a function of a second 
correlation of the second signal transform components corresponding to a second 
correlation length different that the first correlation length; and 
20 generating an output signal as a function of the first weight values and the 

second weight values. 

31. The method of claim 30, wherein the first correlation length and the 
second correlation length differ in accordance with a difference in rate of change of 
at least one fi'equency of at least one of the sensor signals with respect to time. 

25 32. The method of any of claims 22-31, wherein the number of sensors 

is two and the hearing aid has a single, monaiiral output. 

33. The method of any of claims 22-31, wherein said calculating is 
performed to minimize output variance. 

34. The method of any of claims 22-31, further comprising localizing a 
30 selected acoustic soxirce relative to a reference as a function of the transform 

components. 
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35, The method of any of claims 22-31, wherein the transform 
components are of a fourier type. 

36. A hearing aid system operable to perform the method of any of 
claims 22-31. 

5 37. A method comprising: 

detecting acoustic excitation with a number of acoustic sensors, the acoustic 
sensors providing a corresponding number of sensor signals; 

establishing a set of signal transform components for each of the sensor signals; 
tracking location of a somxe of the acoustic excitation relative to a reference as a 
10 function of the transform components; and 

providing an output signal as a function of the location and a correlation of 
the transform components. 

38. The method of claim 37, wherein the number of sensors is two and said 
tracking includes determining a phase difference between the sensor signals. 
15 39. The method of claim 37, wherein the reference is a designated axis 

and the location is provided in the form of an azimuthal direction. 

40. The method of claim 37, wherein said tracking includes generating 
an array with a number of elements each corresponding to a different azimuth and 
detecting one or more peak values among the elements of the array. 
20 41 . The method of claim 37, further comprising adjusting a beamwidth 

factor relative to frequency. 

42. The method of claim 37, further comprising calculating a number of 
different correlation matrices and adaptively changing correlation length of one or 
more of the matrices relative to at least one other of the matrices. 
25 43. The method of claim 37, further comprising steering a direction- 

indicating vector corresponding to the location. 

44. The method of claim 37, wherein said providing include generating 
the output signal by weighting the transform components to reduce variance of the 
output signal and provide a predefined gain. 
30 45. A device operable to perform the method of any of claims 37-44. 
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46. A hearing aid system operable to perfonn the method of any of 
claims 37-44. 

47. An apparatus, comprising: 

an acoustic sensor array operable to detect acoustic excitation, said acoustic 
5 sensor array including two or more acoustic sensors each operable to provide a 
respective one of a number of sensor signals; and 

a processor operable to determine a set of frequency components for each 
of said sensor signals and generate an output signal representative of the acoustic 
excitation from a designated direction, said output signal being calculated from a 
10 weighted combination of said set of frequency components for each of said sensor 
signals to reduce variance of said output signal subject to a gain constraint for the 
acoustic excitation from said designated direction. 

48. The apparatus of claim 47, wherein said processor is operable to 
calculate said weighted combination to generally minimize said variance of said 

15 ou^ut signal and generally maintain said gain at unity. 

49. The apparatus of claim 47, wherein said processor is operable to 
determine a number of signal weights as a function of a frequency domain 
correlation matrix and a vector corresponding to said designated direction. 

50. An apparatus, comprising: 

20 a £urst acoustic sensor operable to provide a first sensor signal; 

a second acoustic sensor operable to provide a second sensor signal; 
a processor operable to generate ain output signal representative of acoustic 
excitation detected with said first acoustic sensor and said second acoustic sensor 
from a designated direction, said processor including: 
25 means for transforming said first sensor signal to a first number of 

frequency domain transform components and said second sensor signal to a 
second number of frequency domain transform components, 

means for weighting said first transform components to provide a 
corresponding number of first weighted components and said second 
30 transform components to provide a corresponding number of second 
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weighted components as a function of variance of said output signal and a 
gain constraint for the acoustic excitation from said designated direction. 



5 



means for combining each of said first weighted components with a 
corresponding one of said second weighted components to provide a 
£requency domain form of said output signal; and 



means for providing a time domain form of said output signal from 
said frequency domain form. 



51 . The apparatus of any of claims 47-50, wherein said processor includes 



means for steering said designated direction. 



10 



52. The apparatus of any of claims 47-50, further comprising at least one 



acoustic output device responsive to said output signal. 

53. The apparatus of any of claims 47-50, wherein the apparatus is arranged as 
a hearing aid. 

54. The apparatus of any of claims 47-50, wherein the apparatus is arranged as 
15 a voice input device. 

55. The apparatus of any of claims 47-50, wherein said processor is operable to 
localize an acoustic excitation source relative to a reference. 

56. The apparatus of any of claims 47-50, wherein said processor is operable to 
track location of an acoustic excitation source relative to an azimuthal plane. 

20 57. The apparatus of any of claims 47-50, wherein said processor is operable to 

adjust a beamwidth control parameter with frequency. 

58. The apparatus of any of claims 47-50, wherein said processor is operable to 
calculate a number of different correlation matrices and adaptively adjust correlation 
length of one or more of the matrices relative to at least one other of the matrices. 
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